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Abstract 

We prove a complexity dichotomy theorem for all non-negative weighted counting Constraint Satisfaction 
Problems (CSP). This caps a long series of important results on counting problems including unweighted and 
weighted graph homomorphisms |fT9l [8l [T8l[T2l and the celebrated dichotomy theorem for unweighted #CSP ||6l 
|4]|2Tl|22l. Our dichotomy theorem gives a succinct criterion for tractability. If a set T of constraint functions 
satisfies the criterion, then the counting CSP problem defined by T is solvable in polynomial time; if it does not 
satisfy the criterion, then the problem is #P-hard. We furthermore show that the question of whether T satisfies 
the criterion is decidable in NP. 

Surprisingly, our tractability criterion is simpler than the previous criteria for the more restricted classes of 
problems, although when specialized to those cases, they are logically equivalent. Our proof mainly uses Linear 
Algebra, and represents a departure from Universal Algebra, the dominant methodology in recent years. 



1 Introduction 



The study of Constraint Satisfaction Problems (CSP) has been one of the most active research areas, where enor- 
mous progress has been made in recent years. The investigation of CSP includes at least the following major bran- 
ches: Decision Problems — whether a solution exists |[36ll27l [3ll32l: Optimization Problems — finding a solution 
that satisfies the most constraints (or in the weighted case achieving the highest total weight) |[26l[3n in [T7l[34l[38l 
[35l : and Counting Problems — to count the number of solutions, including its weighted version |[6ll4l[T0H7ll2TI. 
The decision CSP dichotomy conjecture of Feder and Vardi 11231 . that every decision CSP problem defined by a 
constraint language T is either in P or NP-complete, remains open. A great deal of work has been devoted to the 
optimization version of CSP, constituting a significant fraction of on-going activities in approximation algorithms. 

The subject of this paper is on counting CSP; more precisely on weighted counting Constraint Satisfaction Pro- 
blems, denoted as weighted #CSP. For unweighted #CSP, the problem is usually stated as follows: D is a fixed finite 
set called the domain set. A fixed finite set of constraint predicates F = . . . , @h} is given, where each 0j is a 
relation on D*"' of some finite arity r^. Then an instance of #CSP(r) consists of a finite set of variables xi, . . . , x„, 
ranging over D, and a finite set of constraints from F, each applied to a subset of these variables. It defines a new 
n-ary relation R where {xi, . . . ,Xn) G ^ if and only if all the constraints are satisfied. The #CSP problem then 
asks for the size of R. In a (non-negatively) weighted #CSP, the set F is replaced by a fixed finite set of constraint 
functions, T = {/i, . . . , fh}, where each /j maps D^' to non-negative reals M4.. An instance of #CSP{T) consists 
of variables xi, . . . , Xn, ranging over D, and a finite set of constraint functions from T, each applied to a subset of 
these variables. It defines a new n-ary function F: for any assignment (xi, . . . , x^), F{xi, . . . , x„) is the product 
of the constraint function evaluations. The output is then the so-called partition function, that is, the sum of F over 
all assignments {xi, . . . , x„} D. The unweighted #CSP is the special case where each constraint function is 
0-1 valued. (A formal definition will be given in Section |2l) 

Regarding unweighted #CSP, Bulatov El proved a sweeping dichotomy theorem. He gave a criterion, congru- 
ence singularity, and showed that for any finite set of constraint predicates F over any finite domain D, if F satisfies 
this condition, then #CSP(F) is solvable in P; otherwise it is #P-complete. His proof uses deep structural theorems 
from universal algebra |[mi28ll24l . Indeed this approach using universal algebra has been one of the most exciting 
developments in the study of the complexity of CSP in recent yeai^s, first used in decision CSP Il29ll30l l3ll2l. and 
has been called the Algebraic Approach. 

However, this is not the only approach. In II2TI . Dyer and Richerby gave an alternative proof of the dichotomy 
theorem for unweighted #CSP. Their proof is considerably more direct, and uses no universal algebra other than the 
notion of a Mal'tsev polymorphism. They also showed that the dichotomy is decidable Il20ll22l . Furthermore, by 
treating rational weights as integral multiples of a common denominator, the dichotomy theorem can be extended 
to include positive rational weights 171. 

In this paper, we give a complexity dichotomy theorem for all non-negative weighted #CSP(J^). To describe 
our approach, let us first briefly recap the proofs by Bulatov and by Dyer and Richerby. Bulatov's proof is deeply 
embedded in a structural theory of universal algebra called tame congruence theory |[28l . (A congruence is an 
equivalence relation expressible in a given universal algebra.) The starting point of this Algebraic Approach is the 
realization of a close connection between unweighted #CSP(F) and the relational clone (F) generated by F. (F) 
is the closure set of all relations expressible from F by boolean conjunction A and the existential quantifier 3. A 
basic property, called congruence permutability, is then shown to be a necessary condition for the tractability of 
#CSP(F) ||9l|6l[l0l. It is known from universal algebra that congruence permutability is equivalent to the existence 
of Mal'tsev polymorphisms. It is also equivalent to the more combinatorial condition of strong rectangularity of 
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Dyer and Richerby |[2TI : For any n-aiy relation R defined by an instance of #CSP(r), if we partition its n variables 
into three parts: u = (ui, . . . , uj^),^ = (f i, . . . , f^) and w = {wi, . . . , Wn-k-i)^ then the following \D\^ x \D\^ 
matrix M must be block-diagonal after separately permuting its rows and columns: M (u, v) = 1 if there exists a 
w such that (u, v, w) € R; and Af (u, v) = otherwise. (See the formal definition in Section|2l) 

Assuming T satisfies this necessary condition (otherwise #CSP(r) is already #P-hard), Bulatov's proof delves 
much more deeply than Mal'tsev polymorphisms and uses a lot more results and techniques from universal algebra. 
The Dyer-Richerby proof manages to avoid much of universal algebra. They went on to give a more combinatorial 
criterion, called strong balance: For any n-ary relation R defined by an instance of #CSP(r), if we partition its n 
variables into four parts: u = (m, . . . v = {vi, . . . ,Vi),\v = {wi, . . . ,wt),z = {zi, . . .,Zn-k-e-t), then 
the following \D\^ x \Df integer matrix M must be block-diagonal and all of its blocks are of rank 1 (which we 
will refer to as a block-rank- 1 matrix): 



(See the formal definition in Section |9l) Dyer and Richerby |[2]1 show that strong balance (which implies strong 
rectangulaiity) is the criterion for the tractability of #CSP(r). They further prove that it is equivalent to Bulatov's 
criterion of congruence singularity which is stated in the language of universal algebra. 

The first difficulty we encountered when trying to extend the unweighted dichotomy to weighted #CSP(J-") is 
that there is no direct extension of the notion of strong balance above in the weighted world. While the number 
of satisfying R on the right side of ([TJ can be naturally replaced by the sum of F (any function defined by an 
#CSP(J^) instance) over w, we do not see any easy way to introduce existential quantifiers to this more general 
weighted setting. Moreover, the use of existential quantifiers in the notion of strong balance is crucial to the proof 
of Dyer and Richerby: their polynomial-time counting algorithm for tractable #CSP(r) heavily relies on them. 

While there seems to be no natural notion of an existential quantifier in the weighted setting, we came to a key 
observation that the notion of strong balance is equivalent to the one without using any existential quantifiers (that 
is, we only consider partitions of the variables into 3 parts with no z). We include the proof of this equivalence 
in Section |9l This inspires us to use the following seemingly weaker notion of balance for weighted #CSP(J^), 
with no existential quantifiers at all: For any n-ary function F defined by a #CSP(J^) instance, if we partition its 
n variables into three parts: u = (ui, . . . , n^), v = {vi, . . . ^vi) and w = {wi, . . . , Wn-k-e)^ then the following 
iDl'' X |DK matrix M must be block-rank- 1: 



It is easy to show that balance is a necessary condition for the tractability of #CSP(-F). But is it also sufficient? 
If F is balanced, can we solve it in polynomial time? We show that this is indeed the case by giving a polynomial 
time counting scheme for all #CSP(J-')s with F being balanced. Our algorithm works differently from the one of 
Dyer and Richerby. It avoids the use of existential quantifiers and is designed specially for weighted and balanced 
#CSP(J')s. As a result, we get the following dichotomy for non-negatively weighted #CSP with a logically simpler 
criterion: 

Theorem 1 (Main). #CSP{F) is in polynomial-time if F is balanced; and is ffP-hard otherwise. 

A new ingredient of our proof is the concept of a vector representation for a non-negative function. Let F be a 
function over xi, . . . , x^- Then si, . . . , Sn '■ D ^ IR+ is a vector representation of F if for any (xi, . . . , Xn) € 



M(u, v) = |w : 3 z such that (u, v, w^, z 



) eR] , for all u G L''^ and V G D^. 



(1) 




F(u, V, w), for all u G and v G D^. 
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such that F{xi, . . . , > 0, we have F{xi, . . . , Xn) = si{xi) ■ ■ ■ s„(rE„). The first step of our algorithm is to 
show that given any instance of #CSP(J-'), where T is balanced, the function it defines has a vector representation 
which can be computed in polynomial time. However, F may have a lot of "holes" where si(xi) • • • Sn{xn) > 
but F{xi , . . . , Xn) = so it is still not clear how to do the sum of F over xi , . . . , x„. 

The next step is quite a surprise. Assuming T is balanced, we show how to construct one-variable functions 
t2, ■ ■ ■ ,tn D ^ in polynomial time such that for any {ui, . . . , n„) G with F{ui, . . . , n„) > 0, we have 

J] F(ni,X2,...,x„) = .i(ni)-n?r4- (2) 

The intriguing part of Q is that its left side only depends on ui but it holds for any (ui, . . . , u„) € as long 
as F{ui, . . . , Un) > 0. A crucial ingredient we use in constructing t2, ■ ■ ■ ,tn and proving Q here is the succinct 
data structure called frame introduced by Dyer and Richerby for unweighted #CSP |[2TI (which is similar to the 
"compact representation" of Bulatov and Dalmau IH). Once we have t2, ■ ■ ■ ,tn and Q, computing the partition 
function becomes trivial. 

After obtaining the dichotomy, we also show in Section [6] that the tractability criterion (that is, whether T is 
balanced or not) is decidable in NP. The proof follows the approach of Dyer and Richerby EOl for unweighted 
#CSP, with new ideas and constructions developed for the weighted setting. 

This advance, from unweighted to weighted #CSP, is akin to the leap from the Dyer-Greenhill result on count- 
ing 0-1 graph homomorphisms 091 to the Bulatov-Grohe result for the non-negative case [8|. The Bulatov-Grohe 
result paved the way for all future developments. This is because not only the Bulatov-Grohe result is intrinsically 
important and sweeping but also they gave an elegant dichotomy criterion, which allows its easy application. Al- 
most all future results in this ai^ea use the Bulatov-Grohe criterion. Here our result covers all non-negative counting 
CSP. It achieves a similar leap from the 0-1 case of Bulatov and Dyer-Richerby, and in the meanwhile, simplifies 
the dichotomy criterion. Therefore it is hoped that it will also be useful for future research. 

In hindsight, perhaps one may re-evaluate the Algebraic Approach. We now know that there is another Alge- 
braic Approach, based primarily on matrix algebra rather than (relational) universal algebra, which gives us a 
more direct and complete dichotomy theorem for #CSPs. It is perhaps also a case where the proper generalization, 
namely weighted #CSP, leads to a simpler resolution of the problem than the original unweighted #CSP. 

Weighted #CSP has many special cases that have been studied intensively. Graph homomorphisms can be 
considered as a special case of weighted #CSP where there is only one binary constraint function. There has been 
great advances made on graph homomorphisms |[T9l [8l [T8l[T2l . Our dichotomy theorem generalizes all previous 
dichotomy theorems where the constraint functions are non-negative. Looking beyond non-negatively weighted 
counting type problems, in graph homomorphisms |[25l [T3l [37l great progress has already been made. To extend 
that to #CSPs with real or even complex weights will require significantly more effort (even for directed graph 
homomorphisms llT2l ). For Boolean #CSP with complex weights, a dichotomy was obtained ifTSl . Going beyond 
CSP type problems, holographic algorithms and reductions ai^e aimed precisely at these counting problems where 
cancelation is the main feature. The work on Holant problems and their dichotomy theorems are the beginning 
steps in that direction ifBlfTelflH . 
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2 Preliminaries 



We start with some definitions about non-negative matrices. 

Let M be a non-negative m x n matrix. We say M is rectangular if one can permute its rows and columns 
separately, so that M becomes a block-diagonal matrix. More exactly, M is rectangular if there exist s pairwise 
disjoint and nonempty subsets of [m], denoted hy Ai, . . . , Ag, and s pairwise disjoint and nonempty subsets of [n], 
denoted by . . . , -B^. for some s > 0, such that for all i € [m] and j G [n], 

M{i,j) > <J=^ i ^ Ak and j G Bj^ for some A; G [s]. 

Now let M be a non-negative and rectangular m x n matrix with s blocks Ai x Bi, . . . , Ag x Bg. We say it 
is block-rank-1 if the A^ x Bk sub-matrix of M, for every G [s], is of rank 1. 

The two lemmas below then follow directly from the definition of block-rank- 1 matrices: 

Lemma 1. Let M. be a block-rank-1 matrix with s > 1 blocks: Ai x Bi, . . . , Ag x Bg. If i* G A^ and j* G B^ 
for some k G [s], then for any i ^ A^we have 

EjeB,M{i,j) _ MjiJ*) 
EjeB,Mii*,j) M{i*,fy 

Lemma 2. IfWl is a non-negative matrix but is not block-rank-1, then there exist two rows o/M that are neither 
linearly dependent nor orthogonal. 

2.1 Basic #P-Hardness About Counting Graph Homomorphisms 

Every symmetric and non-negative n x n matrix A defines a graph homomorphism (or partition) function ^a(') 
as follows: Given any undirected graph G = {V, E), we have 

We need the following important result of Bulatov and Grohe ||8l to derive the hardness part of our dichotomy: 

Theorem 2. Let A be a symmetric and non-negative matrix with algebraic entries, then the problem of computing 
Za(-) is in polynomial time if A is block-rank-1; and is #P-hard otherwise. 

2.2 Weighted #CSPs 

Let D = {1,2, . . . ,d}be the domain set, where the size d will be considered as a constant. A weighted constraint 
language T over the domain Z? is a finite set of functions {/i , . . . , fh} in which fi : D^^ ^ M is an rj-ary function 
over D for some > 1. The arity ri of fi,i G [h], the number of functions h in T, as well as the values of fi, will 
all be considered as constants (except in Section [6] where the decidability of the dichotomy is discussed). In this 
paper, we only consider non-negative weighted constraint languages in which every fi maps D^^ to non-negative 
and algebraic numbers. 
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The pair (L), F) defines the following problem which we simply denote by {D, T): 

1. Let X = (xi, . . . , Xn) € D"^ be a set of n variables over D. The input is then a collection I of m tuples 
{f,ii, ... ,ir) in which / is an r-ary function in T and ii, . . . ,ir S [n]. We call n + m the size of /. 

2. The input / defines the following function F/ over x = (xi, . . . , x„) € D"". 

F/(x) = Yi f{xh,- ■ ■ , Xi,), for every x G 
{f,ii,-,ir)ei 

And the output of the problem is the following sum: 

z{i) y: ^k-)- 

xG-D" 

2.3 Reduction from Unweighted to Weighted #CSPs 

A special case is when every function in the language is boolean. In this case, we can view each of the functions 
as a relation. We use the following notation for this special case. 

An unweighted constraint language T over the domain set D is a finite set of relations {Gi, . . . , Qh} in which 
every G/ is an r^-ary relation over D'''^ for some r-i > 1. The language T defines the following problem which we 
denote by {D,T): 

1. Let X = (xi, . . . , Xn) € -D" be a set of n variables over D. The input is then a collection I of m tuples 
(G, ii, ... ,ir) in which G is an r-ary relation in F and ii, . . . ,ir € [n]. We call n + m the size of /. 

2. The input / defines the following relation Rj over x = (xi, . . . , x„) G D^: 

X € i?/ <:=^ for every tuple (G, ii , . . . , ) € /, we have (xj^ , . . . , Xj^ ) € G. 
And the output of the problem is the number of x G D"- in the relation Rj. 

For any non-negative weighted constraint language T = {/i, . . . , fh}, it is natural to define its corresponding 
unweighted constraint language F = {Gi, . . . , G/j}, where x G Gj if and only if /i(x) > 0, for all i e [h] and 
X € D^'k In Section|7J we give a polynomial-time reduction from (D, F) to {D, T). 

Lemma 3. Problem (D, F) is polynomial-time reducible to {D, T). 
Corollary 1. If (D, T) is not #P-hard, then neither is {D, F). 

2.4 Strong Rectangularity 

In the proof of the complexity dichotomy theorem for unweighted #CSPs ||4l|2Tl, an important necessary condition 
for {D, F) being not #P-hard is strong rectangularity: 
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Definition 1 (Strong Rectangulaiity). We say T is strongly rectangular if for any input I of {D, T) (which defines 
an n-ary relation Rj over (xi, . . . , x„) € L)") and for any integers a,b : 1 < a < b < n, the following x d^~°' 
matrix M is rectangular; the rows o/M are indexed by u ^ D"^ and the columns are indexed by v G D^~°', and 

M(u, v) = {we -D""^ : (u, V, w) € i?/} , for all u£ D^andv e D^"". 

For the special case when b = n, we have M{u, v) = 1 // (u, v) € Ri and M(u, v) = otherwise. 

The following theorem can be found in H and ll2ll : 
Theorem 3. IfT is not strongly rectangular, then (D, T) is #P-hard. 

As a result, if {D, F) is not #P-hard, then T must be strongly rectangular by Corollary [T] and Theorem[3l where 
r is the unweighted language that corresponds to F. The strong rectangularity of F then gives us the following 
algorithmic results from |[20i . using the succinct and efficiently computable data structure called frame. They turn 
out to be very useful later in the study of the original weighted problem {D,T). We start with some notation. 

Let / be an input instance of (D, T) which defines a relation R over n variables x = {xi, . . . , Xn)- 

Definition 2. For any i G [n], we use pr^R C D to denote the projection of R on the ith coordinate: a G prji? 
and only if there exist tuples u G D^~^ and v G Z)"~* such that (u, o, v) G R. 

We define the following relation ~i on pr^R: a b if there exist tuples u G D^^^ and Vq, v;, G Z)"~* such 
that (u, a, Va) G R and (u, b, v^) G R. 

Lemma 4 (" 11201 ). IfT is strongly rectangular then given any input I of{D, T) which defines a relation R, we have 

(A) . For any i G [n], we can compute the set pr^i? in polynomial time in the size of I. Moreover, for every 

a G pr^R, we can find a tuple u € R such that Ui = a in polynomial time. 

(B) . For any i G [n], the relation ~j tnust be an equivalence relation and can be computed in polynomial 

time. We will use Si^k D, k = 1,2, to denote the equivalent classes of^i. 

(C) . For any equivalence class £i^k> we can find, in polynomial time, a tuple u'*'*^] G as well as a tuple 

^li,k,a\ g D"^'"^ for each element a G Si^k such that (u'*'*^], a, v'*''"'"!) G R for all a G Si^k- 

As a corollary, if {D, F) is not #P-hard, then we are able to use all the algorithmic results above for {D, T) as 
subroutines, in the quest of finding a polynomial-time algorithm for {D,F). 



3 A Dichotomy for Non-negative Weighted #CSPs and its Decidability 

In this section, we prove a dichotomy theorem for all non-negative weighted #CSPs and show that the chai^acteri- 
zation can be checked in NP. The lemmas used in the proofs will be proved in the rest of the paper. 

In the proof of our dichotomy theorem as well as its decidability, the following two notions of weak balance 
and balance play a crucial role. It is similar to and, in some sense, weaker than the concept of strong balance used 
in |[20l . (Notably we do not use any existential quantifier in the definitions.) 
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Definition 3 (Weak Balance). We say T is weakly balanced if for any input instance I of {D, T) (which defines a 
non-negative function F{xi, . . . , Xn) over D) and for any integer a : 1 < a < n, the following d'^ x d matrix M 
is block-rank- 1; the rows o/M are indexed by \i ^ D"" and the columns are indexed by v € D, and 

M{u,v) = ^ F{u,v,w), for all u e and V e D. 
For the special case when a + 1 = n, we have M(u, v) = F{u, v) is block-rank-1. 

Definition 4 (Balance). We call T balanced if for any input instance I of (D, F) {which defines a non-negative 
function F{xi, . . . , x„) over D) and for any integers a,b : I < a < b < n, the following d"" x matrix M is 
block-rank-1; the rows ofM. are indexed by u ^ D"" and the columns are indexed by v £ D^""", and 

M(u, v) = ^ F(u, V, w) , for all u € -D" W v G D^"". 

For the special case when h = n, we have M(u, v) = -F(u, v) is block-rank-1. 

It is clear that balance implies weak balance. We prove the following complexity dichotomy theorem. 

Tlieorem 4. (D, F) is in PifV is strongly rectangular and F is weakly balanced; and is #P-hard otherwise. 

Proof. Assume {D, F) is not #P-hard. By Corollary [T] and Theorem[3l V must be strongly rectangular. We prove 
the following lemma in Section [H showing that F must be balanced and thus, weakly balanced: 

Lemma S.IfF is not balanced, then (D, F) is #P-hard. 

In the next two sections (Sections |4] and |5]l we focus on the proof of the following algorithmic lemma: 
Lemma 6.IfT is strongly rectangular and F is weakly balanced, then (D, F) is in polynomial time. 

The dichotomy theorem then follows directly. □ 

While the characterization of the dichotomy in Theorem |4] above is very useful in the proof of its decidability, 
we can easily simplify it without using strong rectangularity. We prove the following equivalent characterization 
using the notion of balance: 

Lemma 7. [D, F) is in polynomial time if F is balanced; and is ffP-hard otherwise. 

Proof. Assume {D, F) is not #P-hai-d; otherwise we ai^e already done. By Lemma[5l we know F must be balanced. 
By Theorem |4l it suffices to show that if F is balanced, then F is strongly rectangular, where we use T to denote 
the unweighted constraint language that corresponds to F. This follows directly from the definitions of strong 
rectangularity and balance, since a matrix that is block-rank-1 must first be rectangular. □ 

Next, we show that the complexity dichotomy is efficiently decidable. Given D and F, the decision problem 
of whether [D, F) is in P or #P-hard is actually in NP. (Note that here D and F = {/i, . . . , fh} are considered no 
longer as constants, but as the input of the decision problem. The input size is d plus the number of bits needed to 
describe /i, . . . , fh.) We prove the following theorem in Section [6] The proof follows the approach of Dyer and 
Richerby ll20l . with new ideas and constructions developed for the more general weighted case. It uses a method 
of Lovasz |[33l . which was also used in liTSl . 
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Theorem 5. Given D and T, the problem of deciding whether (D, T) is in P or #P-hard is in NP. 

4 Vector Representation 

Assume T is weakly balanced, and let / be an r-ary function in T. We use to denote the corresponding r-ary 
relation of / in T. In this section, we show that there must exist r non-negative one- variable functions si, . . . , '■ 
D — > ]R_|_, such that for all x G D'', either x ^ and /(x) = 0; or we have /(x) = si(xi) • • • Sr{xr)- We call any 
fi = (si, . . . , Sr ) that satisfies the property above a vector representation of /. We prove the following lemma: 

Lemma S.IfJ- is weakly balanced, then every function f (z J- has a vector representation. 

To this end we need the following notation. Let / be any r-ary function over D. Then for any £ G [r], we use 
/[^l to denote the following ^-ary function over D: 

f^^\xi,...,Xi) = ^ f{xi,...,xe,xe+i,...,Xr), for all xi, . . . , X£ G D. 

Xi^l,...,Xr£D 

In particular, we have /t^'l = /. 

Let / be an r-ary non-negative function with r > 1. We say / is block-rank-1 if either r = 1; or the following 
d^~^ X d matrix M is block-rank-1: the rows of M are indexed by u G D^~^ and the columns are indexed by v 
G D, and M(u, v) = /(u, v) for all u G D''^^ and v G D. 

By the definition of weak balance, Lemma[8]is a direct corollary of the following lemma: 

Lemma 9. Let f{xi, . . . ,Xj.) be an r-ary non-negative function. If f^^^ is block-rank-1 for all £ G [r], then f has 
a vector representation fi. 

Proof. We prove the lemma by induction on r, the arity of /. 

The base case when r = 1 is trivial. Now assume for induction that the claim is true for all (r — l)-ary non- 
negative functions, for some r > 2. Let / be an r-ary non-negative function such that /'^l is block-rank-1 for all 
^ G [r] . By definition, it is easy to see that 

(/[^^i])'^^ = /M, foralHG [r-l]. 

As a result, if we denote f^^'~^\ an (r — l)-ary non-negative function, by g, then is block-rank-1 for every £ G 
[r — 1]. Therefore, by the inductive hypothesis, g = /[''"^l has a vector representation (si, . . . , Sr-i). 

Finally, we show how to construct s,. so that (si, . . . , Sr-i, Sr) is a vector representation of /. To this end, we 
let M denote the following d^"^ x d matrix: The rows are indexed by u G D^^^ and the columns are indexed by 
V G D, and M(u, v) = /(u, v) for all u G D^^^ and f G -D. By the assumption we know that M is block-rank-1. 
Therefore, by definition, there exist pairwise disjoint and nonempty subsets of D^~^, denoted by Ai, . . . , As, and 
pairwise disjoint and nonempty subsets of D, denoted by -Bi, . . . , Bs, for some s > 0, such that M(u, v) > if, 
and only if u G and v G Bi for some z G [s]; and for every i G [s], the Ai x Bi sub-matrix of M is of rank 1. 

We now construct Sr : D ^ ]R+ as follows. For every i G [s], we arbitrarily pick a vector from Ai and denote 
it Uj. Then for v G D, we set Sr{v) as follows: 

1. If V ^ Bi for any i G [s], then Sr{v) = 0; and 
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2. Otherwise, assume v ^ Bi. Then 



M{m,,v) 

Sr[v) = = — -. (3) 



To prove that (si, . . . , s,.) is actually a vector representation of /, we only need to show that for every tuple 
(u, v) such that u G and v G Bi for some i £ [s] (since otherwise we have /(u, v) = 0), we have 

f{u,v)=M{u,v) = Sr{v) Y\_ ^ji'^j)- 

iG[r-l] 

By using Lemma [Hand (O, we have 

M{u,v) = M{n,,v) • f^^^"'""^^^ = ..(r;) ■ /'-^'(u) = J] ^^'M' 

where the last equation above follows from the inductive hypothesis that (si, . . . , Sr-i) is a vector representation 
of g = /['■"^l. This finishes the induction, and the lemma is proved. □ 



5 Tractability: The Counting Algorithm 

In this section, we prove Lemma[6]by giving a polynomial-time algorithm for the problem {D, F), assuming F is 
strongly rectangular and F is weakly balanced. As mentioned earlier, because F is strongly rectangular we can use 
the three polynomial-time algorithms described in Lemma|4]as subroutines. Also because F is weakly balanced, 
we may assume, by Lemma[8l that every r-ary function f in F has a vector representation By = (s/^i, . . . , sj^r), 
where Sf^i : D ^ ]R-|_ for all z G [r] . 

Now let / be an input instance of {D, F) and let F denote the function it defines over x = (xi, . . . , Xn) € D^. 
For each tuple in /, one can replace the first component, that is, a function / in F, by its corresponding relation G 
in F. We use /' to denote the new set, which is clearly an input instance of (D, F) and defines a relation R over 
X e D". We have F(x) > if and only if x G i?, for all x G L>". 

The first step of our algorithm is to construct a vector representation fi = (si, . . . , s„) of F, using the vector 
representations Qf of f, f £ F: 

Lemma 10. Given I, one can compute si(-), . . . , Sn{-) in polynomial time such that for all x G D^, either x ^ R 
and -F(x) = 0; or -F(x) = si(xi) • • • Sn(xn). 

Proof. We start with si, . . . , s„ where Sj(a) = 1 for all i G [n] and a £ D. We then enumerate the tuples in / one 
by one. For each {f,ii, . . . ,ir) G / and each j G [r], we update the function Si.{-) using Sfj{-) as follows: 

Si. (a) = Si. (a) • s fj{a), for every a € D. 

It is easy to check that the tuple (si, . . . , Sn) we get is a vector representation of F. □ 
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The second step of the algorithm is to construct a sequence of one-variable functions . . . ,t2(') 

that have the following nice property: for any iG{l,...,n — 1} and for any u G i?, we have 

F{ui,...,Ui,Xi+i,...,Xn) = Si{ui) ■ ■ ■ Si{ui) ■ ... ^n{Un) 

a;i+i,...,a;„eZ) 

Before giving the construction and proving (011, we show that Z{1) is easy to compute once we have tn, ■ ■ ■ ■, ^2- 

For this purpose, we first compute pr^i? in polynomial time using the algorithm in Lemma|4](A). In addition, 
we find a vector Uq = (ua,i, na,2, • • • , Ua^n) £ R for each a G 'pr^R such that = a in polynomial time. Then 



which clearly can be evaluated in polynomial time using si, . . . , s„ and t2, • • • , in- 
Now we construct tn,tn-i, • • • , ^2 and prove (HJl by induction. We start with 

Because T is weakly balanced, the following d"^"^ x d matrix M must be block-rank-1: the rows are indexed 
by u G D^~^ and the columns are indexed by v £ D, and M{\i, v) = F{\i, v) for all u G and v £ D. By 

the definition of we have ~„ V2 if and only if columns vi and t;2 are in the same block of M and thus, the 
equivalent classes {£n,k} are exactly the column index sets of those blocks of M. 

We define as follows. For every a G -D, if o ^ pi'n^ then tn{a) = 0; Otherwise, a belongs to one of the 
equivalence classes £n,k of and 

tn{a) = ^ 777- (5) 

By using the algorithm in Lemma|4](B) t„ (•) can be constructed efficiently. We now prove (01) for i = n — 1. Given 
any u G i?, we have n„ G pr„i? by definition and let £n^k denote the equivalence class that n„ belongs to. Then 

EF(tii,...,n„_i,5) = Yj F{ui,. . . ,Un-i,b) = J| Sj{uj) ^ = JJ Sj{uj) ■ jj^. 

The last equation follows from the construction ([5]) of and the assumption that Un G 

Now assume for induction that we already constructed tj+i, . . . ,tn, for some i G [2 : n — 1], and they satisfy 
(|4]). To construct tj(-), we first observe that the following d*"^ x d matrix M must be block-rank-1, because T is 
weakly balanced: the rows are indexed by u = {ui, . . . , Ui-i) G I?'^^ and the columns are indexed by u G -D, 

M{u,v)= J2 ^(u,^>w)= Y 

weD"-» (u,v,w)e_R 

Similarly, by the definition of ~j, its equivalent classes {£i^k} are precisely the column index sets of those blocks 
of M. By (111) and the inductive hypothesis we immediately have the following concise form for M(u, v): for any 
w = {wi+i, . . . , Wn) G such that (u, -y, w) G R, we have 

M(u,.)= I n ^M^)\s^{v){ n fl^i- 



ViG[j-l] / \ie[j+l:n] 
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Note that by the choice of w can be arbitrary as long as (u, v, w) G R. 
We now construct ti(-). For every a £ D, 

1. If a ^ prj-R, then ti{a) = 0; and 

2. Otherwise, let £"4 ^ denote the equivalence class of ~j that a belongs to. Then by using the algorithm in 
Lemma|4](C), we find a tuple u 

(^^[i,fc]^^^^[i,fc,6]^ e i?, for all h e £-i,fc. 

Then we set 

M(u^^l,a) 
^"E,..,,M(uM,6)- 

By ((61), ti(a) can be computed efficiently using tuples u'*''^'! and v'*'^'^], for h G Ei^k- 

This finishes the construction of tj(-). 

Finally we prove (01). Let u be any tuple in R and Ei^k be the equivalence class of ~j that Uj belongs to. Then 

-F(mi, . . . jUj-ijXj, . . . = ^ ^ F(-ui,...,Ui_i,6,Xi+i,...,a;n). 

Xi,...,Xn&D b&Si^k Xi + i,...,XneD 

Let u* denote the (i — l)-tuple (ni , . . . , nj_i). Then by the definition of M, we can rewrite the sum as 

F{ui, . . . ,Ui^i,Xi,. . . ,Xn) = M{u*,b). 
xi,...,xneD beSi^k 

Recall the tuples u'*'^'] and v^*'^'^!, b G <fj fc> which we used in the construction of ti{-). Because M is block-rank-1 
and because u* and ul* '^! are known to belong to the same block of M, we have 



However, by the definition (|71) of tj(-), we have 



since we assumed that Uj G f j fc. As a result, we have 



tiiUi) 



Xi,...,x„€D be£,,fe \iG[j-l] / \j£[i:n] ^' 

The last equation follows from This finishes the construction of t„, . . . , t2 and the proof of Lemma[6l 
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6 Decidability of the Dichotomy 



In this section, we prove Theorem |5]by showing that the decision problem is in NP. 

By Theorem|4]we need to decide, given D and T, whether F is strongly rectangular and T is weakly balanced 
or not. The first part can be done in NP ||4l|20l by exhaustively seaixhing for a Mal'tsev polymorphism. 

Lemma 11 (H1 I20II ). Given T, deciding whether it is strongly rectangular is in NP. 
6.1 Primitive Balance 

Next we show the notion of weak balance is equivalent to the following even weaker notion of primitive balance: 

Definition 5 (Primitive Balance). We say T is primitively balanced if for any instance I of {D, T) and the n-ary 
function Fj{xi, . . . , x„) it defines, the following dx d matrix M/ is block-rank-1: The rows o/M/ are indexed by 
xi £ D and the columns are indexed by X2 G D, and 

Mi{xi,X2)= ^ Fi{xi,X2,X3, . . . ,Xn), for all xi, X2 £ D. (8) 

X3,...,x„GD 

It is clear that weak balance implies primitive balance. The following lemma proves the inverse direction: 
Lemma 12. IfV is primitively balanced , then it is also weakly balanced. 

Proof. Assume for a contradiction that F is not weakly balanced. By definition, this means there exist an / over 
n- variables and an integer a : 1 < a < n such that the following d"- x d matrix M is not block-rank-1: the rows of 
M are indexed by u G D"^ and the columns are indexed by v £ D, and 

M{u,v) = ^ Fj{u,v,w), for all u G and i; e 

As a result, we know by Lemma |2] that A = M^M is not block-rank-1. 

To reach a contradiction, we construct /' from / as follows: /' has 2n — a variables in the following order: 

Xl,X2,yi, ... ,ya,Zi,... ,Zn-a-l,Wi, . . . ,Wn-a-l- 

The instance /' consists of two parts: a copy of / over {yi, . . . ,ya, xi, zi, . . . , Zn-a-i) and a copy of / over (yi , . . . , 
ya, X2,wi, . . . , Wn-a~i)- Let Fji denote the function that /' defines. It gives us the following d x d matrix M//: 

Mji{xi,X2)= ^ Fj{y,xi,z) ■ Fj{y,X2,w) = M{y,xi) ■ M{y,X2) = A{xi,X2), 

yGD^.z.wS-D"-"-! ySD" 

which we know is not block-rank-1. This contradicts with the assumption that F is primitively balanced . □ 

Now the decision problem reduces to the following, and we call it 

PRIMITIVE BALANCE: Given D and F such that F is strongly rectangular (which 
by LemmafTTlcan be verified in NP), decide whether F is primitively balanced. 
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Since T is strongly rectangular, we know that for any input I of {D,T), the d x d matrix M/ defined in dUl must 
be rectangular. We need the following useful lemma from |[20l . which gives us a simple way to check whether a 
rectangular matrix is block-rank- 1 or not. 

Lemma 13 ([20|). A rectangular d x d matrix M is block-rank-1 if and only if 

M{a, KfM{/3, \fM{a, X)M{p, k) = M{a, XfM{p, KfM{a, k)M{(3, A) (9) 

for all a ^ (3 ^ D and k ^ \ G D. 

As a result, for PRIMITIVE BALANCE it suffices to check whether Q holds for M/, for all instances / and for 
all a / /3, K 7^ A G Z?. In the rest of this section, we fix a 7^ /3 € Z) and k / A G D, and show that the decision 
problem (that is, whether Q holds for all /) is in NP. Theorem [5] then follows immediately since there are only 
polynomially many possible tuples (a, /3, k, A) to check. 

6.2 Reformulation of the Decision Problem 

Fixing a ^ 13 G D and k ^ X G D, we follow ll20l and reformulate the decision problem using a new pair 
that is, the 6-th power of {D,T): 

1. First, the new domain D = D^, and we use s = (si, . . . , sg) to denote an element in D, where Sj G D. 

2. Second, ^ = {gi, . . . , gh} has the same number of functions as T and every gi, i G [h], has the same 
arity rj as /j. Function g^ : D^^ — )• is constructed explicitly from fi as follows: 

5i(si,. . .,Sr,) = Y{ fii^hj^ ■ ■ • for all Si, . . . ,5r, G D = -D*^. 

i6[6] 

In the rest of the section, we will always use Xi to denote variables over D and yi, Zi to denote variables over D. 

Given any input instance / of {D, T) over n variables (xi, . . . , it naturally defines an input instance 3 of 
(S, ^J) over n variables (yi, . . . , y„) as follows: for each tuple (/, ii, . . . , ir) G /, add a tuple {g,ii, . . . , ij.) to 3, 
where g £ ^ corresponds to f £ T. Moreover, this is clearly a bijection between the set of all / and the set of all 
3. Similarly, we let G : S?" — ?• M+ denote the ?i-ary function that 3 defines: 

G{yi,...,yn)= Y\. aiVii,- ■ ■ .Vir)^ for all yi, . . . , y„ G £>. 

(g,il,...,ir)G3 

The reason why we introduce the new tuple (2) , 5^) is because it gives us a new and much simpler formulation of 
the decision problem we are interested. 

To see this, we let a, b, c denote the following three specific elements from T): 

a = {a,a,a, f3, f3, f3), b = {k, k, X, X, X, k), c = {X, X, k, k, k, X). 

Since a ^ fi and k / A, 0, b, c are three distinct elements in T). We adopt the notation of ||20]| . For each s G S, let 

hovas{3) =* ^ G{a,B, 2/3, ... , for every instance 3 of (S, i?). 
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It is easy to prove the following two equations. Let 3 be the instance of (T),^) that corresponds to /, and M/ be 
the d X d matrix as defined in Then 

hombp) = M/(a, k)^Mi{I3, XfMi{a, X)Mi{l3, k) and 
hom^p) = Mj{a,X)^Mj{l3,K)^Mi{a,K)Mi{P,X) 

As a result, we have the following reformulation of the decision problem: 

Mj satisfies © for all / <;=^ homa(a) = hom,(a) for all 3 

The next reformulation considers sums over injective tuples only. We say (yi, . . . , y„) G is an injective 
tuple if yi 7^ yj for all i ^ j ^ [n] (or equivalently, if we view (yi, . . . , y„) as a map from [n] to D, it is injective). 
We use Yn to denote the set of injective n-tuples. (Clearly this definition is only useful when n < otherwise 
Yn is empty.) We now define functions mong(3), which are sums over injective tuples: For each s G 2), let 

mons(D') =^ ^ G(o, s, 7/3, ... , ?/„), for every instance 3 of (D, 5^). 

{a,s,y3,...,y„)£Y„ 

The following lemma shows that hom(,(Di) = homc{3) for all 3 if and only if the same equation holds for the 
sums over injective tuples. The proof is exactly the same as Lemma 41 in |[20l . using the Mobius inversion. So we 
skip it here. 

Lemma 14 ([20|, Lemma 41). hom[,(J) = hom^{3)for all 3 if and only if monb{3) = mon^{3) for all 3. 

Finally, the following reformulation gives us a condition that can be checked in NP: 

Lemma 15. mont, (3) = mon,; (3) for all 3 if and only if there exists a bijection vr from the domain to itself 
{which we will refer to as an automorphism /rowi {T)^^) to itself) such that 7r(a) = '/r(a), vr(b) = ■7r(c), and for 
every r-ary function g £ ^, we have 

g{yi,. . . ,yr) = g(TT{yi),. . . ,Tr{yr)^, for all yi, . . . ,yr e D. (10) 

Proof. We start with the easier direction: If vr exists, then mon(,(J) = monc(3) for all 3. This is because for any 
injective n-tuple (a, b, 2/3, ... , y„) G Yn, we can apply vr and get a new injective n-tuple (a, c, vr(y3), . . . , vr(y„)) G 
Yn and this is a bijection from (a, b, 1/3, ... , y„) G Yn and (a, c, Z3, . . . , z„) G Yn- Moreover, by (flOl ) we have 

G(a, b,i/3,. . .,?/„) = G(a, c,7r(?/3), . . . ,7r(y„)). 

As a result, the two sums mont,(3) and mon^{3) over injective tuples must be equal. 

The other direction is more difficult. First, we prove that if mont,(J) = monc(3) for all 3, then for any 3 and 
any tuple (a, b, ^3, . . . , ?/„) G Yn with G(o, b, ^3, . . . , > 0, there exists a (a, c, Z3, . . . , z„) G Yn such that 

G{a,b,y3,. . . ,yn) = G{a,c,Z3, . . . ,Zn). (11) 
To prove this we look at the following sequence of instances 5i = ^,^2, ■ ■ ■ defined from 3, where 2j consists of 
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exactly j copies of J over the same set of variables. We use Gj to denote the n-ary function that 5j defines, then 

Gjivi,. . . ,yn) = (G(yi, . . . ,?/„))^ for all yi, . . . , ?/„ G D. 

Let Q = {qi, . . . , q\Q\} denote the set of all possible positive values of G over Yn, let ki > denote the number 
of tuples (a, b, ys, . . . , y„) E Yn such that G(o, b, ys, . . . , y„) = Qi, i G let > denote the number of 

tuples (a, c,y3, ...,?/„) G y„ such that G{a, c,y3, . . . , y„) = g^, i G [IQj]- Then by mon(,(aj) = mon^pj), 

«G[|Q|] jG[|Q|] 

Viewing ki — £{ as variables, the above equation gives us a linear system with a Vandermonde matrix if we let j go 
from 1 to \Q\. As a result, we must have ki = £i for all i G [\Q\], and ([TT]) follows. 
To finish the proof, we need the following technical lemma: 

Lemma 16. Let Q be a finite and nonempty set of positive numbers. Then for any k >\, there exists a sequence 
of positive integers Ni, . . . , such that 

Qi'Q2' ■■■%' = il'if' ^2^' ■ ■ ■ Wk)''' , where qi, . . . ,qk,q[ . . . ,q'k e Q (12) 
if and only if qi = q[ for every i £ [k]. 

Proof. The lemma is trivial if \Q\ = 1, so we assume \Q\ > 2. We use induction on k. The basis is trivial: we just 
set A'^i = 1. Now assume the lemma holds for some k > I, and A'^i, . . . , Nk is the sequence for k. We show how 
to find A^/c+i so that A'^i, . . . , Nj^^i satisfies the lemma for fc + 1. To this end, we let 

Cmin = min q/q' > I and c^ax = max q/q'. 

q>q'€Q q>q'eQ 

Then we let Nk+i be a large enough integer such that 

(c ■ )^''+^ > (c "l^'SW^' 

To prove the correctness, we assume ([T2l) holds. First, we must have qk+i = Qk+i- Otherwise, assume without 
generahty that q^^i > g^^^, then by (fT2l ) 

which contradicts with the definition of Nk-{^i. Once we have q^+i = g^+i, they can be removed from (fT2l) and by 
the inductive hypothesis, we have q^ = q'- for all i £[k]. This finishes the induction, and the lemma is proved. □ 

To find vr, we define the following 3. It has variables and we denote them by y^, s G !D. (In particular, ya 
and ?/(, are the first and second variables of 3 so that later mons(D') is well-defined.) Let L be the set of all tuples 
((7,Si, . . . ,5,.), where g is an r-ary function in ^ and ^(si, . . . ,Sr) > 0. We let A'^i, . . . , A^|^| be the sequence of 
positive integers that satisfies Lemma [T6] with k = \L\ and 

Q = • • • ,Sr.) : (c/,5i,...,Sr.) G lI. 
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Then we enumerate all tuples in L in any order. For the ith tuple (51,51, ... ,Sr) € L,i € [[L|], we add Ni copies 
of the same tuple (5,31, ... ,5,.) to 3. This finishes the definition of 3. 

From the definition of 2, it is easy to see that G{ys : ye = s for all s € > 0. Therefore, by (fTTT) we know 
there exists a tuple (zg : s € D) € l^n such that Za = a, = c, and 

: ys = 5 for all 5 G D) = G{zs : s G D) > 0. 

def 

We show that 7r(s) = z^, for every s € 2), is the bijection that we are looking for. 

First, using Lemma[T6l it follows from the definition of 3 that for every tuple ((7, si , . . . , Sr) G L, we have 

g{5i,.. . ,5r) = 9(vr(si), . . . ,7r(Sr)). 

So we only need to show that 5(7r(si), . . . , vr(Sr.)) = whenever g{5i, . . . ,5^) = 0. This follows directly from the 
fact that vr is a bijection and thus, (si, . . . ,Sr) (7r(si), . . . , 7r(Sr.)) is also a bijection. □ 

With Lemma [14] and Lemma [T5l we only need to check whether there exists an automorphism vr from (Tl,^) 
to itself such that 7r(a) = a and 7r(b) = c. We can just exhaustively check all possible bijections from Tl to itself, 
and this gives us an algorithm in NP. 



7 Proof of Lemma |3] 

Let / be an input of {D, T) with n variables x = {xi , . . . , Xn) and m tuples, and R be the relation it defines. 

For each A; > 1, we let Ik denote the following input of {D, T): Ik has n variables (xi, . . . , x^); and for each 
(0, ii, . . . , ir) G /, we add k copies of (/, ii, . . . , ir) to Ik, where / G J-" is the r-ary function that corresponds to 
G r. We use i*fc(x) to denote the n-ary non-negative function that Ik defines. Then it is clear that 

Ffc(x) = (Fi(x))'', for all x G D". (13) 



We will show that to compute \R\, one only needs to evaluate Z{Ik) for k from 1 to some polynomial of m. This 
gives us a polynomial-time reduction from (D, F) to {D, T). 
Now we let Qm denote the set of all integer tuples 



q = ^q-^t > : i G [/i] and t G D""' such that /j(t) > 
that sum to m. And let VALUEm denote the following set of positive numbers: 



Value^ 



It is easy to show that both \Qm \ and jVALUEmj are polynomial in m (as d, h and ri, i £ [h] are all constants) and 
can be computed in polynomial time in m. Moreover, by the definition of Value^ we have for every x G 

Fi(x) > =^ Fi(x) G VALUE^. 
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For every c € VALUEm, we let denote the number of x £ L)" such that -Fi(x) = c. Then we have 

Z{h)= J2 ^--c (14) 



ceVALUE„ 

We also have 



\R\= (15) 



cGVALUEm 

and by ^ 

Z{Ik) = ^c-d", for every k>l. (16) 

CgVALUEm 

If we view {Nc : c € VALUER } as variables, then by taking /c = 1, . . . , |VALUEm|, (fT6l ) gives us a Vandermonde 
system from which we can compute Nc, c G Valuer, in polynomial time. We can then use (ITSl) to compute 
This finishes the proof of Lemma [3l 



8 Proof of Lemma |5] 

Assume that F is not balanced. Then by definition, there exists an input instance I for {D, T) such that 

1. It defines an n-ary function F{xi, . . . , x^); and 

2. There exist integers a,h : \ < a < h < n such that the following d"- x (f'"- matrix M is not 
block-rank- 1: the rows are indexed by u G and the columns are indexed by v G D^~°-, and 

M(u, v) = F{u, V, w), for all u G and v G D'''". 

Because M is not block-rank-1, by Lemma|2j it has two rows that are neither linearly dependent nor orthogonal. 
We let M(ui, *) and M(u2, *) be such two rows, where ui, U2 G D"^. Then 

0< (M(ui,*),M(u2,*)>' < (M(ui,*),M(ui,*)>-(M(u2,*),M(u2,*)). (17) 

We let A = MM^, which is clearly a symmetric and non-negative d"" x matrix, with both of its rows and 
columns indexed by u G D". It then immediately follows from ([TT] ) that A is not block-rank-1, since all the four 
entries in the {ui, U2} x {ui, U2} sub-matrix of A are positive but this 2x2 sub-matrix is of rank 2 by ([17] ). 

To finish the proof, we give a polynomial-time reduction from Za{-) to {D, F). Because the former is #P-hard 
by Theorem [2] (since A is not block-rank-1), we know that {D, F) is also #P-hard. 

Let G = {V, E) be an input undirected graph of Zx{-). We construct an input instance Iq of (Z?, F) from G, 
using I (which is considered as a constant here since it does not depend on G), as follows. 

1. For every vertex f G F, we create a variables over D, denoted by Xy^i, . . . , x^^a', and 

2. For every edge e = ' G E, we add (6 — a) + 2{n — b) variables over D, denoted by 

ye,a+l ) ■ ■ ■ ; Vefii Ze,b+1 Ze,m ^e,&+l ) ■ ■ ■ ; ^e,n- 
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Then we make a copy of / over the following n variables: 



(^•^v,li ■ ■ ■ 1 ■^Vjai Ue.a+l t • • • i Vcjbi ^e,b+l > • • • i ^e.n) 

as well as the following n variables: 

(•^u',! ) • • • ) ■^v' ,ai ye,a+l i • • • 5 ye,bi ^e,fe+l ' • • • 5 ^e,n) ' 

This finishes the construction of Iq- 

It is easy to show by the definitions of M and A above that Za{G) = Z{Ig)- This gives us a polynomial-time 
reduction from problems Za{-) to {D, T) since Iq can be constructed from G in polynomial time. 

9 Equivalence of Balance and Strong Balance 



In EH Dyer and Richerby used the following notion of strong balance for unweighted constraint languages F and 
showed that (D, T) is in polynomial time if T is strongly balanced; and is #P-hard otherwise. 

Definition 6. Let T be an unweighted constraint language over D. We call T strongly balanced if for every input 
instance I of {D, T) {which defines an n-ary relation R) and for any a,b,c : 1 < a < b < c < n, the following 
d"' X d^~"' matrix M is block-rank-1: the rows are indexed by u £ and the columns are indexed byvG D^~"', 

M(u,v)= {w e D"-'' -.Bz e D''"'' such that {u,v,w,z) e R} , for all vl ^ and w D^"" . (18) 

There are two special cases. When c = b, M(u, v) is 1 if there exists a z G D"'~^ such that (u, v, z) G R; and is 
otherwise. When n = c, M(u, v) is the number ofw G D'^^^ such that (u, v, w) G R. 

Theorem 6. {D, T) is in polynomial time ifV is strongly balanced; and is ffP-hard otherwise. 

Notably the difference between the notion of balance we used for weighted languages F (Definition and the 
one above for unweighted languages F ||2n is that we do not allow the use of existential quantifiers in the former. 
One can similarly define the following notion of balance for unweighted F: 

Definition 7. Let F be an unweighted constraint language over D. We call F balanced if for every instance I of 
{D, F) {which defines an n-ary relation R) and for any a,b : 1 < a < b < n, the following x d^~"' matrix M 
is block-rank-1: the rows are indexed by u £ Z?" and the columns are indexed by v G D^~'^, 

M(u,v) = {w G Z)"-'' : (u,v,w) G it:} , for all u £ D" and w e D^'" . (19) 

We show below that these two notions, strong balance and balance, are equivalent. 

Lemma 17 (Equivalence of Balance and Strong Balance). IfT is balanced, then it is also strongly balanced. 

Proof. We assume that F is balanced. Let / be any instance of {D, F) which defines an n-ary relation R. Let a, b 
and c be integers such that l<a<6<c<n. It suffices to show that the matrix M in (fTSl ) is block-rank-1. 
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For this purpose, we define a new input instance 1^ of F) for eacli k > 1: 

1. First, Ik has c + k{n — c) variables in the following order: 

Xi, . . . , Xc, yi,c+l) • • • ) yi,ni • • • ) yk,c+li ■ ■ ■ ) yk,n- 

Below we let y^, i € [k], denote {yi^c+i, • • ■ , yi,n) for convenience. 

2. For each i € [k], we add a copy of / on the following n variables of I^: xi, . . . , Xc, yi,c+i, ■ ■ ■ , yi,n- 

It is clear that Ii is exactly /. We also use Rk to denote the relation that Ik defines, k > 1. 

Because F is balanced, the following d"" x d''"'^ matrix M.^^^ is block-rank-1: For u € D'^ and v € D''""', 

mW(u,v) = {(w,yi,...,yfc) : w G yi, . . . , y^ G D""^ and (u, v, w,yi, . . . ,yfc) G i^fc} . 

From the definition of Ik, we have M(u, v) > if and only if Afl''] (u, v) > 0, for all u G -D" and v G D^~°-. 

Therefore, there exist pairwise disjoint and nonempty subsets of D", denoted Ai, . . . , As, and pairwise disjoint 
and nonempty subsets of Z)^^", denoted Bi, . . . , Bg, for some s > 0, such that 

M(u, v) > Afl^l(u> v) > u G and V G Bf, for some £ G [s]. 

Now to prove that M is block-rank-1, we only need to show that for every G [s], 

M(ui, vi) • M(u2, V2) = M(ui, V2) • M(u2, vi), for all ui, U2 G Ai and vi, V2 G B^. (20) 

To prove (l20l) . we let 

Wij = |w G D""-^ : 3y G such that (u^, v^, w, y) G i?}, fori,j G {1,2}. 

Furthermore, for every w G Wij, we let j,w denote the (nonempty) set of y G D^~^ such that (uj, Vj , vi^, y) G 
R. Now using j and ^j,j,w> it follows from the definition of 1^ that 

Because M''""] is block-rank-1, we have the following equation for every k > 1: 

(|>l,l,w| • |>2,2,w'|)'= (|>'l,2,w|•|l2,l,w'|)^ 

weH/'i,i,w'eW2,2 wGWi,2,w'eH/'2,i 

Since the equation above holds for every k > 1, the two sides must have the same number of positive terms. By 
definition, we have li,j,w is nonempty for all w G Wij. As a result, we have 

\Wi,i\ ■ \W2,2\ = \Wl,2\ ■ \W2,l\ 

and (I20I ) follows. This finishes the proof of Lemma[T7] □ 
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